2022-05-06

Introduction

-Bioinformatic tools such as AlphaFold has PDB as a source for training the prediction of 3D structures of proteins

-Models with better predictions could be the result of bias from the types of proteins that have been studied extensively

-The problem: The data present in the https://www.rcsb.org/stats did not provide the insights we needed for this

Re-creation of PDB Data Statitics

Most common entity type

Re-creation of PDB Data Statitics

Re-creation of PDB Data Statitics

Most common method

Re-creation of PDB Data Statitics

Entries added every year

Re-creation of PDB Data Statistics

Re-creation of PDB Data Statistics

##Introduction

The Research Collaboratory for Structural Bioinformatics - Protein Data Bank (RCSB-PDB)

  • An open archive of experimental 3D structures
  • Estimated 1 Million unique users annually